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1. Introduction 

Measure rigidity is a branch of ergodic theory that has recently contributed to the 
solution of some fundamental problems in number theory and mathematical physics. 
Examples are proofs of quantitatiye versions of the Oppenheim conjecture jS], related 
questions on the spacings between the values of quadratic forms [U |H1 El, a proof of 
quantum unique ergodicity for certain classes of hyperbolic surfaces fB] , and an approach 
to the Littlewood conjecture on the nonexistence of multiplicatively badly approximable 
numbers [T]. 

In these lectures we discuss a few simple applications of one of the central results in 
measure rigidity: Ratner's theorem. We shall investigate the statistical properties of cer- 
tain number theoretic sequences, specifically the fractional parts of ma, m = 1, 2, 3, . . ., 
(a classical, well understood problem) and of ^ma (as recently studied by Elkies and 
McMuUen j2]). By exploiting equidistribution results on a certain homogeneous space 
r\G, we will show that the statistical properties of these sequences can exhibit sig- 
nificant deviations from those of independent random variables. The "randomness" of 
other, more generic sequences such as m?a and T^a mod 1 has been studied extensively. 
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We refer the interested reader to the review [lOj, and recommend the papers f^lEj as 
a first read. 

These notes are based on lectures presented at the Institute Henri Poincare Paris, June 
2005, and at the summer school 'Equidistribution in number theory', CRM Montreal, 
July 2005. The author gratefully acknowledges support by an EPSRC Advanced Re- 
search Fellowship. 

2. Randomness of point sequences mod 1 

Consider an infinite triangular array of numbers on the circle T = M/Z (which we 
represent as the unit interval [0, 1) with its endpoints identified), 

(2.1) ; ; ••. 

^Nl ^N2 ■ ■ ■ ^NN 



We assume that each row is ordered, i.e., ^nj < ^N{j+i), and are interested in quantifying 
statistical properties of the iVth row as iV — >■ cxd. To simplify notation we will from now 
on drop the index N, and simply write instead of ^nj- 

As we shall see later, many interesting statistical properties of a sequence on T can 
be derived from the knowledge of the number of elements in small subintervals of T. 
Let X denote the characteristic function of the interval [— i, |) C M. That is, x{^) = 1 
if — i < X < I and = otherwise. The characteristic function of the interval [xq — 
|, Xo + |) + Z C T (£ < 1) can be represented as 

' X — Xo + n 



(2.2) Xi{x)=J2x 



The sum over n makes sure Xi is periodic. The number of elements in the interval 
are therefore 

N 

(2.3) S^{i)=Y,XiiQ. 

i=i 

We will always assume that the rows in our triangular array become uniformly dis- 
tributed mod one. This means that for every Xo,£, 

(2.4) hm ^S^{i) = i, 

i.e., the proportion of elements in any given interval is asymptotic to the interval length 



""^If a sequence {^j} fails to be uniformly distributed but still has a resonable limiting density p, 
we may rescale the to obtain a uniformly distributed sequence. This is done as follows. Suppose 

for every Xo,£ limjv^oo jf^Ni^) = Jx°-i/2 P^^'^'^''^' where the integrated density N{x) = p{x')dx' 
is continuous and strictly increasing. We rescale the sequence by setting £_j := A^(Cj). Note that 
e [0,1) for E [0,1). The new sequence {^j} is indeed uniformly distributed modulo one 
(exercise). 
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The aim is now to characterize the different degrees of "randomness" of the deter- 
ministic sequence {^j} in terms of their distribution in very small intervals with random 
center xq. A convenient length scale is the average spacing between elements, which is 
1/N. We set 

(2.5) L = Nl 

We assume Xq is a random variable uniformly distributed on T with respect to Lebesgue 
measure dxQ. We will denote expectation values by 

(2.6) (...)= [ ...dxo. 



It is easy to work out the expectation value for the number of elements in a random 
interval of size £, 

(2.7) (SM) = L. 

The variance is much less trivial. Let us begin by deriving a convenient representation 
in terms of the pair correlation density. We have for the mean square (the "number 
variance" ) 

(2.8) E^(L) {[SM - L]') = {SM') - 
and 



i,j=l meZ 

4i -^j + m 



i,j=l m£Z 



e 



where 



(2.10) ^(^) = / ~ Xo)x{^o)dxo — max{l — \x\, 0}. 

Now the diagonal terms i = j in the above double sum can be easily evaluated. We 
have 

(2.11) eY.J2Hj)=^^(^)-^ 

i=j=l meZ ^ ^ 

for i<l. 

The pair correlation function (also called two-point correlation function) for the se- 
quence {^j} is defined by 

(2-12) ««(i.^) = ^EE^(^^^T^). 
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where i/j is taken from a class of sufficiently nice test functions (e.g. continuous with 
compact support such as A) . With the above calculation we therefore have the identity 



(2.13) E%{L) = L-L^ + LR%{L,A). 

This says that the asymptotic analysis of the pair correlation density will give us infor- 
mation on the number variance. 

Note that by the Poisson summation formula 

(2.14) ^/(m) = 5^/(n), 
where 

(2.15) f{y)^ f{x)e{xy)dy, e(x) := exp27rix, 
we have 

(2.16) R%iL,^)=^,f: Ev^(f 

Here ip can be any function with absolutely convergent Fourier series (e.g. A). 

2.1. Distribution of gaps. A popular statistical measure is the distribution of gaps 

(2.17) = Ar(e,-+i - Q (j = 1, . . . , iV, ^Tv+i := 6 + 1) 

between consecutive elements (recall the form an ordered sequence on T). We have 
multiplied the actual gap ^j+i — by iV, which means we are measuring spacings in 
units of the average gap 1/N . 

The gap distribution of the sequence Ci? ■ ■ ■ ? Civ is defined as 

1 ^ 

(2.18) p^(,)^_^5(,_,.) 

i=i 

where 5 is a Dirac mass at the origin. The question wc will investigate is whether P/v(s) 
has a limiting distribution P(s). That is, does there exist a probability density P{s) 
such that for every bounded continuous function : R — > R, 

/»oo /»oo 

(2.19) lim / g{s)PN{s)ds ^ / g{s)P{s)ds. 

Jo Jo 

The first question in convergence of probability measures is the problem of tightness. 

Lemma 2.1. The sequence of probability measures {P/v(s)(is} is tight on M. That is, 
for every e > there is a K > such that for all N 

(2.20) [ PN{s)ds < e. 

J\s\>K 
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Proof. We have 

/ PN{s)ds = <N:s^>K} 

J\s\>K 

1 ^ S- 1 ^ s- 

(2.21) ^ ]^I];^X[i^,oo)(s,) <J^J2^ 

j=l j=l 

1 ^ 1 



□ 



Denote by Ep^{k,L) the probabihty of finding k elements in the randomly shifted 
interval [xq^xq + i.e., 

(2.22) EN{k, L) := meas {xq G T : Sn{() = A;} . 

The following theorem explains the relation between P{s) and the probability £'(0, L). 

Theorem 2.2. Given a probability density P{s), the following statements are equivalent. 

(i) Pr,{s) P{s). 

(ii) limjv->oo Ej^{0, L) — E{0, L) for all L > 0, where E{0, L) is defined by 

(-) ^-^L, JI.-(°.^)-. i-^-- 

Proof. We have 

En{0, L) = meas [xo G : #{j : e [xq, + ^) + Z} = O} 



(2.24) 



meas {xq e [^j, Cj+i) : #0' : e [xq, + ^) + Z} = 0} 
E ( ^i+i - - /V ) - ^^O) 

TV N 

L ^ 

^E^[^--)(^(^^+i-^^)) 



= i-^E^(-.) 

where 

(2.25) 5f(x) = max{0,a;,L} 

is a bounded continuous function. 
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"(i)^(ii)." With the above choice of test function (i) imphes 

(2.26) Mm En{0,L) = F{L) -.= 1- sP{s)ds-L P{s)ds. 

Jq Jl 

Now 

dFiL) , , , d^F(L) , , 

(2.27) ^ = '1 P^^^"^- ^ = ^W- 

and 

(2.28) hm F(L) = 1, hm ^^1^ = q. 

"(ii)=^(i)." Since the sequence of probabihty measures Pn{s) is tight, it is relatively 
compact by the Helly-Prokhorov Theorem (also often called Helly's Theorem). That is, 
every subsequence of N contains a convergent subsequence iVj for which PnXs) — P{s) 
as i —>■ oo. This implies (recall the first part of the proof) that En-{0, L) E{0, L) for 
all L > 0. Hence every convergent subsequence has the limit E{0,L), and thus every 
subsequence convergences. □ 

2.2. Independent random variables. In order to understand which statistical be- 
haviour we should expect for the deterministic sequences we will study later, let us 
assume the vector ^ = {^i, ■ ■ ■ ,^n) is a uniformly distributed random vector on T-^ 
with respect to Lebesgue measure d^ = d^i ■ ■ ■ d^jy. (This means the are independent 
uniformly distributed random variables.) We can ignore the issue of ordering the 
here because of the symmetry of the measure dx under permutation of coordinates. 
Expectation values and associated probabilities of a random variable X = X(^) will be 
defined as 



(2.29) EX= / XdC, 

(2.30) Prob(X > R) = meas{^ eT^ : X > R}. 
Theorem 2.3. There is a constant C > such that, for all e > 0, N, L, 

(2.31) Prob(|i?^(L,V^) - L^(0)| > e) < C- ^ 



Proof. First of all, we have for the expectation (the n = term in (j2.16|) ) 

(2.32) erul,^) = ^^^~'^^ m = Lma+oiN-')). 

Secondly, for the variance of Rj^{L, ip), 

(2.33) E\Rl{L,^)-ERlm^)\^ 

E E ^(f )v^(^)E[e(n(6-e,)-n'(e.-e,0)]- 
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Now 

(2.34) E[e{ni^,-Q-n'{^,-^,,))] 
This implies that 

(2.35) E\Rl{L,^)-ERl{L,^)\^ 

The above theorem imphes that for a "generic" choice of the triangular array ()2.ip . 
we have 

(2.36) Rl{L,tP) = Li^{0)+o{l) 

in the limit N ^ oo, £ = L/N ^ 0. This implies for the variance 

(2.37) S^(L)=L + o(L) 

almost surely in the above limit. 

Using standard techniques from probability theory, one can extend these results on 
the variance to the full distribution of a generic realization of the random sequence in 
a small randomly shifted interval. There are two scaling regimes. 

Regime I (Central Limit Theorem): In the limit L ^ oo, N ^ oo, i = L/N — > 
we have 

(2.38) meas Lq G T : '^^ /^^ ~ ^ > r\ ^ ^ H e-'"'Ht 
almost surely. 

Regime II (Poisson Limit Theorem): For L fixed, oo, we have 

(2.39) EM{k,L)^—e-'^. 

K\ 

almost surely. 

3. ma MOD ONE 

We will now consider the statistical properties of the sequence given by the fractional 
parts of ma, m = 1,2,3, .. . for some a. This problem was studied by Berry- Tabor, 
Pandey et al., Bleher, Mazel-Sinai and Greenman using continued fractions (see [7j for 
detailed references). In particular, it is a classical result that there are at most three 
distinct values for the gaps occurring in ma mod 1 which already indicates a rather 
non-generic behavior of the sequence, see e.g. [T3] . 

Here we will use the approach introduced in [7] that has the advantage of avoiding 
continued fractions and thus allowing higher- dimensional generalizations, such as the 
analysis of the distribution of linear forms modulo one. It is also very close to the work 
of Elkies and McMullen on ^/m mod 1 which we will discuss in the next section. 



1 ii n = n', i = i' , j = j' 

or if = —n', i = j', j = i' 
otherwise. 



i;o(ivvi)=o(A). 



□ 
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We will be interested in the regime where L = Ni is fixed (Poisson scaling regime). 
The number (j2.3|) of elements in an interval of size i and centered at Xq is then 



m=l neZ ^ 

(3.1) = X{o,i]l^jX[-L/2,L/2]{N{ma + n-xo)) 



{ma + n — xq) 

1j 



(m,n)GZ2 



where xi denotes the characteristic function of the interval / C M and 

(3.2) ipix, y) = X(o,i] {x)xi-L/2,L/2] (y) 

is the characteristic function of a rectangle. 

Define the Lie Group G by the semidirect product SL(2,M) k with multiplication 
law 

(3.3) (M,0(M',O = (MM',eM' + 0, 

where ^, ^' G are viewed as row vectors. This group has the matrix representation 



(3.4) (M,0 



The function 



(f ?) e SL(3. 



(3.5) f(M,?)= J] '(;)(mAf + f) 

defines a function on G. Note that, with as above, the sum in ()3.5|) is always finite, 
and hence F is a piecewise constant function. Furthermore, 

(3.6) SM = F{M,0 
for the special choice 

(3.7) M=(J l)(T n)^ e=(0,-xo)M. 

The crucial observation is now that F is left-invariant under the discrete subgroup 
r = SL(2, M) X R^, and hence F may be viewed as a piecewise constant function on the 
homogeneous space T\G. 

Proposition 3.1. F{^g) = F{g) for all 7 G F. 
Proof. We have the decomposition 

(3.8) ^ = (^,n) = (7,0)(l,n) 
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for some 7 G SL(2,Z), n G Z^. It is therefore sufficient to check the statement for 
elements of the form (7,0) and (l,n) separately. We have 

F((l,n)(M,0) = i^(M,nM + 

= ^ i){{m + n)M + C) 

= F(M,0 

which proves one case, and 

F((7,0)(M,O) = i^(7M,O 



(3.9) 



(3.10) 



J2 H^lM + 
J2 + 



= F{M,0 

since 7Z2 = Z^. □ 

Alternatively, F may be expressed as 
(3.11) Fig)= ^i<19)) 

7G7r(r)\r 

with the projection 

(3 12) tt: G - 

From (j3.1ip the invariance under F is directly evident. 

3.1. Geometry of F\G. The aim is to find a good coordinate system for G. Since 
parametrizing is obvious, we need to mainly worry about SL(2,M). The Iwasawa 
decomposition of an element M G SL(2, M) is 

'1 u\ \ / cos(0/2) sin(0/2)\ 

1 M ^"'^V V- sin(0/2) cos(0/2) ) 



(3.13) M 



where r = u + iv G EI := {r G C : Imr > 0} (the complex upper halfplane) and 
G [0,47r). This yields a 1-1 map SL(2,M) EI x [0,47r). Left-multiplication becomes 
now an action of SL(2, M) on EI x [0, 47r) given by the formula 

(3.14) (! ';'l = (5I±4,^-2arg(cr + d) 



c dj ycT + d 

(this can be checked by a straightforward calculation). The fractional linear transfor- 
mation of the T component defines an (orientation preserving) isometry with respect to 
the Riemannian line element 

(3.15) ds^ = ^-^^^ 

V 
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Figure 1. Fundamental domain of the modular group SL(2,Z) in the 
complex upper half plane. 



and the transformation property of (f) is identical to the direction of a tangent vec- 
tor at r e H. Thus the group PSL(2,R) := SL(2,M)/{±1} ~ H x [0, 27r) can be 
identified with the unit tangent bundle H of H. Similarly, SL(2, Z)\ SL(2, M) ~ 
PSL(2, Z)\ PSL(2, M) can be identified with the unit tangent bundle of the modular 
surface SL(2, Z)\]HI. A fundamental domain for the action of SL(2, Z) on EI is shown 
in Figure n We have 

T = {t eU: \r\> 1, I Rer| < 1/2} 

(3.16) U {r G e : |r| > 1, Rer = -1/2} 

U {r G e : |r| = 1, -1/2 < Rer < 0}. 

Note that the modular surface is not compact, there is one cusp at ioo. It has however 
finite measure with respect to the Riemannian volume v~'^dudv. 
In order to understand the geometry of all of T\G, write 

(3.17) ^?=(l,O(M,0) 

which gives a particular parametrization in terms of and SL(2,]R). Since F contains 
the subgroup 1 k Z^, ^ can be parametrized by = Z^\]R^ ~ [0, 1)^. This concludes 
our analysis: we have found a 1-1 parametrization of G in terms of 

(3.18) T^(SL(2, M)\H) x ~ J^^ x [0, 2n) x [0, 1)^. 

That is, F\G is a (non-trivial) bundle over T^(SL(2, Z)\e) with fibre T^. 

3.2. Dynamics on F\G. Consider the one-parameter subgroup := {$*}feM where 



(3.19) 
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defines a flow on r\G by right multiplication, 

(3.20) ^ rc/$*. 

The remarkable observation is that our object of interest, S^li), is related to a function 
F on r\G evaluated along an orbit of this flow: 

(3.21) SNii) = F((7o$*) 
with t = 2 log and initial condition 

(3.22) 90 =(^(1 "),(0,-xo; 
Let us define 

(3.23) n_ia,y)= (^(^J "),(0,y) 

The subgroup H = {n^{a,y)}(^a,y)m^ is abelian and isomorphic to M^. Notice that 

(3.24) TnH = {n.{a,y)}^o.,y)ez^ 

is a subgroup of H isomorphic to Z^. Therefore, for every fixed t, the set 

(3.25) T\TH^^ 

describes a torus ~ embedded in r\G; t parametrizes a continuous family of such 
tori. 

We will now show that H parametrizes the unstable directions of the flow $*. We 
employ the following parametrization of G. Write 

(3.26) g = n_{a,y)^'n^{(3,x), 
where 

(3.27) n+(/?,x)= (^(^^ fj,{x,0) 

We will write for short g = {a,y, s, f3,x). The advantage of these coordinates is that 
the time evolution under can be worked out very simply. We have the relation 

(3.28) (a, y, s, p, x)¥ = <l>*(e*a, e^^^y, s, e"*/?, e'^/^x). 

Distances on r\G are measured by a left-G-invariant (since F acts on the left) Riemann- 
ian metric d{g,g') on G. If g = (a,?/, 0,0,0) and g' = {a' ,y' ,0,0,0) are two initially 
close points, we have under the flow $* (use the above formula and left-invariance of 
the metric) 

d{g^\ g'^') = d{{e\a - a'),e"\y - y'), 0, 0, 0), (0, 0, 0, 0, 0)) 

■ ^{e^'\a-af + e'\y-y'\y\ 

Hence (a, y) describe exponentially unstable directions of the flow, and by the same 
argument it is easy to see that (/3, y) are the exponentially stable directions and s is of 
course the neutral flow direction. In particular we have the bound 

(3.30) d{{a, y, s, p, x)^\ {a, y, 0, 0, 0)$*) = 0{\s\ + |/3|e-* + \x\e~^/^) 

for s,(3,x bounded and t> 0. This follows directly from (j3.28|) . 
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3.3. Mixing and uniform distribution. Recall that we are interested in the be- 
haviour of the distribution of Sn{£) for xq random and N large. At this point it will 
be convenient to also take a to be random, say, uniformly distributed in the interval 
[a,b]. We will see later that for fixed a there is no universal limiting distribution (an 
observation that is well known and related to the three gap theorem ^3]). 

We will use equidistribution on r\G to prove the following limit theorem, which 
asserts a limiting distribution different from Poissonian, cf. ()2.39p . 

We will use the notation g = Tg. 

Theorem 3.2. For any L > 0, 

(3.31) lim meas{(a, a;o) G [a, b] x [0, 1] : SN{i) = k} = E{k, L), 
N^oo — a 

where 

(3.32) Eik, L) = -j;^^fi9 e r\G : F{g) = k). 

Here F is the function defined in (|3.5|) . and /i the Haar measure on G. An explicit 
formula for ci/i in the Iwasawa coordinates is 

(3.33) djji = — - — d(j)dxdy. 

It is possible to derive more explicit formulas for E{k, L) from ()3.32|) . but this requires 
some involved calculations which we will not pursue her. See |TH], Section 8, for details. 
The key to the proof is the following equidistribution theorem. 

Theorem 3.3. For any bounded, piecewise continuous^ f : T\G M 

(3.34) lim-^/" [ f{n4a,y)^')dady = —^—[ fdfx. 
t^ocb-a Jo ^(r\G) ir\G 

Proof. It is well known that the fiow is mixing,'^ that is for any f,hE L^(r\G') 

(3.35) hm / fig¥)h{g)df, = — i— / fdf^ [ hd^,. 

Jr\G \G) Jt\g Jr\G 

Take / to be continuous an of compact support, and h the characteristic function of the 

set 

(3.36) S, = r{(a, y, s,(3,y):ae [a, b], y e [0, 1], s, /3, x G [-e, e]}, 

which forms an e-neighbourhood of the embedded closed torus 5*0. By the uniform 
continuity of / and ()3.30|) . given any 6 > there is an e > such that 

(3.37) snp\f{g^')-f{n.{a,y)^')\<5. 

i>0 

Haar measure in the local coordinates {a,y, s, P,y) reads (up to normalization) 

(3.38) dfi = e^^^'^ds da df5 dx dy. 
^i.e. the discontinuities are contained in a set of measure zero. 

"^This is guaranteed by a general theorem by Moore for semisimple Lie groups, which can be extended 
to the non-semisimple G considered here, cf. jH]. 



DISTRIBUTION MODULO ONE AND RATNER'S THEOREM 



13 



We conclude that 

(3.39) liminf-^ / f f{n_{a,y)¥)dady = I fdfx + 0i5) 



'a Jo 

and 



(3.40) limsup-^ / I f{n4a,y)^')dady=—-— I /rf/i + 0(5), 
b-aj^ Jq /i(r\G) Jr\G 



where the imphed constants are independent of e. This works for any 6 > 0, and hence 
the hmit must exist and equal ^^p*^^^ Jp^^ fdfi. 

To extend the statement of the theorem to bounded continuous functions, we observe 
that it holds (trivially) for constant /, and therefore also for continuous functions / 
that are constant outside some compact set. 

Let / be a bounded piecewise continuous function. Given any e > we can find 
continuous functions f±, constant outside some constant set, such that 

(3.41) /-</</+ 
and 



This implies 
(3.43) 

liminf / / f{n^{a,y)^*)dady >\imin{ / / f^{n^{a,y)^*)dady 

t^oo b-aj^ Jo t-.oo b-aJa Jo 

f-dfj, 



> J: / fdfi - 2e. 
fi{T\G) Jr\G 

The analogous argument shows 

(3.44) limsup-^ / f f{n^{a,y)¥)dady < f fdfx + 2e. 

t-^oo b-a J^ Jq /i(r\G') Jr\G 

Taking e > arbitrarily small proves the theorem. □ 

Remark 3.1. An alternative proof of Theorem 13.31 follows from Ratner's theorem, since 
the subgroup {n_{a,y)}a^y^R is generated by unipotent elements. We will get back to 
this later. 

Proof of Theorem VJ.iA Apply Theorem 13.31 to the characteristic function of the set of 
'g G T\G for which F{g) = k (to make sure the characteristic function is piecewise 
continuous, check that the set has a boundary of /i measure zero). □ 

Remark 3.2. As we had mentioned earlier, there is no limiting distribution as in Theorem 
13.21 if a is fixed, since there is no analog of the equi distribution result. Theorem 13.31 
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One can show, however, that if a is irrational we have for any continuous, compactly 
supported function 

(3.45) Jj{n-{c^,yW)dy = 7 (^(l i) (%'^' J/^)) ^ ""^^^ ^ 

where / is a (non-constant!) continuous, compactly supported function on SL(2, Z)\ SL(2, 
defined by 

(3.46) 7(M)= [ /((l,O(M,0))cie 

Remark 3.3. If one however fixes y = —Xq ^ Q and keeps a random, Ratner's Theorem 
implies the following equidistribution result. For any bounded piecewise continuous 
/ : T\G R 

(3.47) hm / /(n_(a,|/)$*)da = — i— [ fd^i. 

Hence the limiting distribution is universal (i.e. independent of y as long as y is irra- 
tional) and the same as for random y. Thus the probability of finding k points in the 
interval Xq — £/2, xq + £/2) with fixed center xq ^ Q has the limiting distribution 

(3.48) lim meas{a G : Sn{(.) = k} = E{k, L), 
the same as for random center. We will prove ()3.3|) in Section El 



4. y/ma MOD ONE 



The problem of the statistics of y/rna mod 1 has been understood by Elkies and 
McMuUen j2] in the case a = 1 (and in principle also for all other rational a). The 
uniform distribution of y/ma mod 1 may be shown by using the fact that a/ti + m — 
^/n for n — > oo, m fixed (we leave this as an exercise). As in the last section, 
the key idea is the reduce the problem to equidistribution on a homogeneous space. 
Lucky for us, this homogeneous space will turn out to be r\G with the same G, T as 
encountered earlier. 

We are as in the previous section interested in the "Poisson scaling limit", i.e. L is 
fixed. Now (we swap m and n in our notation) 

(4.1) SM = J2J2^( -Xo + m)\. 

n=l mGZ ^ ^ 

The condition imposed on the summation can be re-written as 

lV ( 



(4.2) ^ ~ ~ 2iV j - < ( ^0 ~ + ^ 



which amounts to 



(4.3) -j^{xo-m) <na- {xQ-m) -[j^j <J^{^o~^)- 
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Notice also that 

(4.4) < A. 
This yields 

(4.5) S«W= E X.„,„( 

( N^/^[na - (xq - mf - {L/2Nf] 
^["^'^^ V N-^l'^{xQ-m) 
A more convenient object would be 

(4.6) S,M()= J2 N-y^ixo-m) J' 

(m,n)6Z2 \Vivw/ \ VU 7 / 

For the right choices of e (positive/negative) we obtain upper/lower bounds for Sn{^) 
which would eventually allow us to infer the hmiting distribution of Sn{(^) from SN,e,s{£) 
by taking 5^0, e — ±0. We will ignore this technical point here and simply take 

o //,\ 7^ /n\ sr^ fxQ~m\ f N^/'^\na — [xq — mY]\ 

(4.7) 3,(0 ^ s.Mi) - E n.-^) )■ 

(m,n)eZ2 VVivw/ \ VU ;/ 

The manipulations we will now perform on the r.h.s. of ()4.7p can be adapted step 
by step for more general values of 5, e 7^ (recommended exercise). We will use the 
shorthand Sj\f{£) := SNfifi{£) in the following. 

4.1. The case a = 1. We have, after substituting {m,n) {—m, —n), 
4.8 SN{i) = > X{o,i] — 7=f- X{-L,L] — TTTTTJT — I ^ ' 

an thus, after substituting n ^ n + m? in the sum over n. 

We will now show that, in analogy with the previous section, we can find a function 
F : r\G ^ M of the form 

(4.10) F(M,0= 5^ ^'(mM + O 



mfc 



SO that 

(4.11) SN{t)=F{g) 

for a suitable choice of (7 G G and a piecewise continuous : — >■ M with compact 
support. To this end define 

(4.12) ^(x,?/) = X{o,i]{x)X{-L,L] (^) 
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(which indeed has compact support: it is the characteristic function of a triangle). Now 
consider the one parameter subgroup {ni{x)}x:£R with 

(4.13) ^^(^)=((J 
(check that this indeed yields a one parameter group). Then the choice (set t = logN, 

X = Xo) 

(M,0 = rii(xo)$* 

(4.14) ^ |^|^iV-V2 2xoiVV^ 

yields 

(4.15) (m, n)M + {N-^^^{xo + m), N^/^{2mxo + n + xD) . 

Using this result in the definition ()4.10j) then confirms the desired fj4.1H) . 

We now follow the same steps as in the previous Section El to derive the limiting 
distribution for Si\f{£) from equidistribution on r\G. We first state the limit theorem. 

Theorem 4.1. For any L > 0, 

(4.16) lim meas{xo E T : SN{i) = k} = E{k, L), 

AT— >-oo 

where 

(4.17) Eik, L) = -j^Mg e T\G : F{g) = k), 



with F as defined in ()4.10|1 . 

An explicit formula for E{0, L) and the corresponding gap distribution P{s) (recall 
Theorem 12. 2j) is worked out in 

The relevant equidistribution theorem needed to prove Theorem 14.11 is the following. 
Note that F fl {ni{x)}xeR = {ni{x)}xez and hence 

(4.18) r{ni(x)},eT$* 

represents a family (parametrized by t) of closed orbits embedded in r\G. 
Theorem 4.2. For any bounded piecewise continuous f : r\G R 

(4.19) hm / /(ni(x)$*)rfx = — i— / fd^x. 

Since ni{x) generates a unipotent fiow, Ratner's theorem can be employed. We will 
explain the general strategy of proof for statements of this type in Section El 

4.2. Some heuristics in the case a = We return to generic a, such as a = -\/2, 
and rewrite Sjqif) as 

(4.20) SM= J2 Xm[-^ Xi-LM[ M-^xo + m) J 
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where M — y/Na. For xq G [0, 1] we can ignore terms of the form xq/M, 
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(4.21) S^{1)^ X(o,i] (^) X(-L,L] 



(m,n)GZ2 



Now note that for most values of m, we have m/M x 1, and it is natural to assume 
that, for random xq, the probabihty of finding k elements of the set 

(4.22) {a-\xo + m)^ ■.m = l,...,M} + Z 

in an interval of size 1/M around the origin is given by the Poisson distribution (we 
must assume here that a is badly approximable by rationals, e.g. a = \f2 would be a 
good choice). Hence we may assert that the limiting distribution of Sjsiip) is the same 
as that of the random variable 



(4.23) 



+ n] 



where r]m are independent uniformly distributed random variables on [—1/2, 1/2). With 
this choice of interval the only contribution comes from the n — Q term (assume M ^ L), 
so 



M 



(4.24) 

where 
(4.25) 



m=l 



— X{-L,L] 



m 



is a sequence of independent random variables with kih. moment 



(4.26) 

and hence 
(4.27) 



ex: 



1/2 



X{-L,L] 



1/2 



E(e'*^™ - 1) 



m 



2Lm 



drir, 



e"-l). 



2Lm 



The characteristic function of the random variable X is therefore 



Ee^ 



itX 



M 

n 

m=l 



1 + 



2Lm 



1) 



(4.28) 



M 



exp 



= exp 



m=l 
M 

E 

m=l 

L(e'* 



2Lm, -.f ^, 
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alpha:=sqrt(2) ; N:=6001; 

N 6001 

L:=sort( [seqCevalf [12] (frac(sqrt(n*alpha))) , n=l. .N)]) : 
alist:=seq(evalf [12] (N*(L[i+l] -L[i] )) ,i=l. .N-1) : 

data: =stats [transform, tally into [' outliers ' ] ] ( [alist] , [seq( (i-1) *0 . 2 . 
i*0.2,i=0. .35)]) : 

outliers ; 



[7.0547245915, 7.0674227075, 7.1105849000, 7.1693268887, 7.2093775627, 
7.3219323187, 7.3381866273, 7.4195061783, 7.5000233956, 7.6451419780, 
7.7497418084, 7.9388213164, 8.0221013941, 8.1512135092, 8.4582030656] 

> datal : =stats [transform, scaleweight [l/nops( [alist] ) ] ] (data) : 

> gl : =stats [statplots , histogram] (datal) : 

> g2:=plot(exp(-s) , s=0..6): 

> plots [display] (gl,g2) ; 




Figure 2. Maple worksheet for calculating the gap distribution of the 
^mV2, m = 1,...,6001. 



fractional parts of 



The expression e^'^'^'* is the characteristic function of the Poisson law 



(4.29) 



Hence this should be our prediction for the limiting distribution of S]\f{i), which in turn 
implies that we expect the exponential distribution for gaps in ^ma mod 1. This is in 
good agreement with our Maple experiment, Figure |21 
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5. RATNER'S THEOREM 

An excellent introduction to Ratner's theory is Dave W. Morris' recent textbook ^T] . 
Let G be a Lie group (e.g. SL(2, M) x M^) and F be a discrete subgroup (e.g. SL(2, Z) x 
Z^). It is at this point not necessary to assume that F is a lattice in G, i.e., that F\G 
has finite volume with respect to Haar measure /i on G. Ratner's measure classification 
theorem gives a complete geometric description of all measures that are invariant and 
ergodic under the a unipotent one parameter subgroup U (or, more generally, invariant 
and ergodic under a subgroup generated by unipotent subgroups) acting on F\G by 
right multiplication. Examples of unipotent subgroups that appeared in the previous 
sections are {n_{a,0)}am, {n-{0,y)}am and {ni(x)}^.g]R. 

Theorem 5.1 (Ratner's theorem). Let v he an ergodic, U -invariant probability measure 
on T\G. Then there is a closed, connected subgroup H C G, and a point g G F\G such 
that 

(1) u is H -invariant, 

(2) u is supported on the orbit gH . 

Remark 5.1. Let (7 G G be a representative of the coset ^ = Tg, and define the subgroup 
Th = {g~^Tg) fl H. Then the orbit gH may be identified with the homogeneous space 
Th\H and u with the Haar measure on H. Furthermore one can deduce (since z/ is a 
probability measure) that Th is a lattice in H, i.e., v{gH) < 00, and that the orbit 'gH 
is closed in F\G. 

In simple words, measures u invariant and ergodic under unipotent subgroups are 
supported on nice embedded closed subvarieties, of which there can be only countably 
many (modulo translations of course). We will now discuss two corollaries of Ratner's 
theorem that are relevant to the equidistribution theorems discussed earlier. 

5.1. Limit distributions of translates. The following is special case of Shah's ex- 
tremely useful theorem. Theorem 1.4 in [14j. 

Theorem 5.2. Suppose G contains a Lie subgroup H isomorphic to SL(2, M) (we denote 
the corresponding embedding by (p : SL(2,M) G), such that the set T\TH is dense 
in V\G. Then, for any bounded, piecewise continuous f : F\G M and any piecewise 
continuous h : M. M. with compact support 

where fi is the Haar measure of G. 

The general strategy of proof for statements of the above type is as follows. 

(1) Normalize h such that it defines a probability density. 

(2) Show that the sequences of probability measures ut defined by 

(5.2) "'(^'"//KG l)C'o" e'/^)))'"W^- 

is tight. Then, by the Helly-Prokhorov theorem, it is relatively compact, i.e., 
every sequence of Ut contains a convergent subsequence with weak limit u, say. 
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(3) Show that v is invariant under a unipotent subgroup \J\ in the present case, 
(5.3) V=U 



1 X 
1 



(4) Use a density argument to rule out measures concentrated on subvarieties (ex- 
ploit the assumption that Y\Tli is dense in Y\G). 

As an application of Shah's theorem we give a proof of the statement in Remark 13.31 
in fact a slightly more general version allowing for non-constant h. Recall that here 
G = SL(2, R) X M2 and T = SL(2, Z) x Z^. 

Corollary 5.3. Let y ^ Q. For any bounded piecewise continuous f : r\G M and 
piecewise continuous h : M. —>■ R with compact support 

(5.4) lim / f{n_{a,y)<l>')h{a)da= ^ [ fdfi [ h{a)da. 
*^°°JR K^\G) Jt\g Jr 

Proof. We define the embedding ip : SL(2,M) — G by 

(5.5) M^(l,(0,y))(M,0)(l,(0,i/))-\ 
We need to show that 

(5.6) (7,n)(l,(0,y))(M,0)(l,(0,y))-^ 

is dense in G as 'j,n,M vary over SL(2,Z), Z^, SL(2,R), respectively. It is obviously 
sufficient to show this for 

(5.7) (7, n)(l, (0, y)){M, 0) = (7M, (m, {y + n^))M), 

and thus for (M, (ni, {y + n2))7~^M). It is however easy to see, using the irrationality 
of y, that (ni, [y + n2))7~^ is dense in R? (exercise). The completes the proof of the 
density. 

Shah's theorem says now that 

(5.8) lim / /(n_(a, y)¥n_{{], y)-^)h{a)da = ^ / fdfi [ h{a)da. 
*-*°°jR K^\G) Jr\G Jr 

for all bounded, piecewise continuous /. Choosing the test function 

(5.9) f{g) = fign-{o,y)) 

which is left-F-invariant and bounded, piecewise continuous, if / is (as assumed). This 
yields (Q- □ 

5.2. Equidistribution, unbounded test functions and diophantine conditions. 

In some applications of Ratner's theorem, e.g., in questions of value distribution of 
quadratic forms 131111111^1, the test functions / in the equidistribution theorems are no 
longer bounded. Under such circumstances the convergence of the integral can only be 
assured by assuming certain diophantine conditions. Without going into the intricate 
details for general T\G, we will illustrate this phenomenon in the distribution of ma on 
T, which indeed may be viewed as a unipotent orbit on the homogeneous space Z\R. As 
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mentioned earlier, it is well known that for a ^ Q the sequence is uniformly distributed 
mod 1. That is, for any bounded continuous function / : T ^ M we have 

1 ^ f 
(5.10) lim — V f{ma) = / f{x)dx. 

N^co JS ^ J J 



m=l 



Let us know formulate the analogous statement for test functions with a possible sin- 
gularity at X = 0. 

It is convenient to identify T with [—1/2,1/2). For any /3 > we define the class 
KpiT) of functions continuous on T — {0}, with the property that there is a constant 
C > such that 

(5.11) \f{x)\<C\x\~^, for all a; G [-1/2,1/2). 

We say a G M is diophantine of type k, if there exists a constant c > such that 



P 

a 

Q 



c 

> — 



for all p, g G Z, g > 0. The smallest possible value of k is k = 2 (achieved for quadratic 
surds, e.g., a = a/2), and it is well known that for any k > 2 there is a set of full 
Lebesgue measure of a that have type k. 

Theorem 5.4. Let a he diophantine of type n. Then, for any f G -ft'/3(T) with < P < 
1/(k-1), 



1 ^ r 

(5.12) lim — V f{ma) = / f{x)dx. 

m=l ^ 



Proof. We split f = f^ — f^ into positive and negative part, such that /± > 0. Then 
f± G Kp{T) and we may prove ()5.12j) for both f± separately. In the following we will 
thus assume that / > 0. 
For any e > let us define 



f{x) if > e 

min{/(x), /(e)} if \x\ < e 

and Qe = f — fe- Note that < /. By assumption, 

< C\x\-f^ if |x| < e 
: if \x\ > e. 



(5.13) /,(x) 



(5.14) g^ix) 



The function is bounded continuous, and hence by uniform distribution 

N 

(5.15) lim lV/e(ma)= / /e(x)dx = / f{x)dx-0{e'-^). 
Since < /, this implies the lower bound 

1 ^ r 

(5.16) liminf — V f{ma) > / f{x)dx - 0(e^-^). 

m=l 
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As to the upper bound, 
(5.17) -}_^g,ima) < ^ 2^ 



m=l ?Ti=l " " 

where || ■ || denotes the distance to the nearest integer. Using the dyadic decomposition 
of the unit interval, we find 

^ X(0,e](||"^«||) I X{e2-U + ^),e2-J]{\\ma\\) 



1 X{o,e]{\\ma\\) ^]_Y^y^ 
AT 2^ llmn-P AT 2^ 2^ 



m=l " " i=0 m=\ " " 

N 



(5.18) 



j=0 m=l 



oo 



< ^ ^ 2^(^+1) (e2-(-'+i))^ (for some 5 > 0) 
^ i=o 

= 0(e^-^). 

The inequahty before the last follows from Lemma [5.51 below. We conclude that 

1 ^ 

(5.19) -Y^g^{ma) = 0{e-^--^) 

m=l 

Therefore 

1 ^ 1 ^ 

limsup — / (ma) = limsup — ^^[/^(ma) + fi'e(ma)] 

(5.20) ^-^1 ^'^i 

< / /(x)rfx + 0(e^-'^) + 0(e^-''), 

in view of and (jOTI|l . 

Since e > can be arbitrarily small, the limsup and liminf must coincide. □ 

The following lemma is used in the preceding proof. 

Lemma 5.5. Let a he diophantine of type k. Then there is a constant B > such that, 
for any interval [xq, xq + i], 



(5.21) #{m = 1, . . . , AT : ma G [xq, Xq + i] + Z} < 



zfN^'^HKc 
BNf.^l^''-^'' otherwise. 



Proof. Define T = l/£. Let us divide the counting into blocks of the form 

(5.22) #{mo < m < mo + T^'^''-^^ : ma G [xq, Xq + ^] + Z}, 

The number of such blocks contributing to (j5.2H) is less than 0{NT^^^^'^~^^ + 1). 

The gaps between elements of the sequence ma mod 1, mo < m < mo + T^l^'^^^\ are 
of the form na mod 1, with \n\ < 2T^^^'^~^\ By the diophantine condition, the gaps 
therefore have seize at least 

(5.23) ||na|| > > 
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An interval of size £ — 1/T can hence at most contain a bounded number of elements. 

Hence 

(5.24) #{mo < m < mo + T^/^""^) : ma e [xo, xq + i] + Z} < B' 

for some constant 5' > independent of mo,xo,£. Recall that there were at most 
jVT~^/('*~^) + 1 such blocks, and this yields the upper bound in the second alternative. 

The first alternative is easily proven since the minimum gap size for the full sequence 
m = 1,...,A^ is at least c/(2A^)''-^ □ 
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